Graph Synopses, Sketches, and Streams: A Survey

نویسندگان

Sudipto Guha

Andrew McGregor

چکیده

Massive graphs arise in any application where there is data about both basic entities and the relationships between these entities, e.g., web-pages and hyperlinks; neurons and synapses; papers and citations; IP addresses and network flows; people and their friendships. Graphs have also become the de facto standard for representing many types of highly structured data. However, the sheer size of many of these graphs renders classical algorithms inapplicable when it comes to analyzing such graphs. In addition, these existing algorithms are typically ill-suited to processing distributed or stream data. Various platforms have been developed for processing large data sets. At the same time, there is the need to develop new algorithmic ideas and paradigms. In the case of graph processing, a lot of recent work has focused on understanding the important algorithmic issues. An central aspect of this is the question of how to construct and leverage small-space synopses in graph processing. The goal of this tutorial is to survey recent work on this question and highlight interesting directions for future research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

gSketch: On Query Estimation in Graph Streams

Many dynamic applications are built upon large network infrastructures, such as social networks, communication networks, biological networks and the Web. Such applications create data that can be naturally modeled as graph streams. In the graph stream model, edges of the underlying graph are received and updated sequentially in a form of a stream. It is often necessary and important to summariz...

متن کامل

Streaming Algorithms for Distributed, Massive Data Sets

Massive data sets are increasingly important in a wide range of applications, including observational sciences, product marketing, and monitoring and operations of large systems. In network operations, raw data typically arrive in streams, and decisions must be made by algorithms that make one pass over each stream, throw much of the raw data away, and produce \synopses" or \sketches" for furth...

متن کامل

An Approximate L-Difference Algorithm for Massive Data Streams

متن کامل

Sketch-Based Multi-query Processing over Data Streams

Recent years have witnessed an increasing interest in designing algorithms for querying and analyzing streaming data (i.e., data that is seen only once in a fixed order) with only limited memory. Providing (perhaps approximate) answers to queries over such continuous data streams is a crucial requirement for many application environments; examples include large telecom and IP network installati...

متن کامل

Dynamic Graphs in the Sliding-Window Model

We present the first algorithms for processing graphs in the slidingwindow model. The sliding window model, introduced by Datar et al. (SICOMP 2002), has become a popular model for processing infinite data streams in small space when older data items (i.e., those that predate a sliding window containing the most recent data items) are considered “stale” and should implicitly be ignored. While p...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

PVLDB

دوره 5 شماره

صفحات -

تاریخ انتشار 2012

Graph Synopses, Sketches, and Streams: A Survey

نویسندگان

چکیده

منابع مشابه

gSketch: On Query Estimation in Graph Streams

Streaming Algorithms for Distributed, Massive Data Sets

An Approximate L-Difference Algorithm for Massive Data Streams

Sketch-Based Multi-query Processing over Data Streams

Dynamic Graphs in the Sliding-Window Model

عنوان ژورنال:

اشتراک گذاری